Competing Bandits: Learning Under Competition

نویسندگان

  • Yishay Mansour
  • Aleksandrs Slivkins
  • Zhiwei Steven Wu
چکیده

Most modern systems strive to learn from interactions with users, and many engage in exploration: making potentially suboptimal choices for the sake of acquiring new information. We initiate a study of the interplay between exploration and competition—how such systems balance the exploration for learning and the competition for users. Here the users play three distinct roles: they are customers that generate revenue, they are sources of data for learning, and they are self-interested agents which choose among the competing systems. In our model, we consider competition between two multi-armed bandit algorithms faced with the same bandit instance. Users arrive one by one and choose among the two algorithms, so that each algorithm makes progress if and only if it is chosen. We ask whether and to what extent competition incentivizes the adoption of better bandit algorithms. We investigate this issue for several models of user response, as we vary the degree of rationality and competitiveness in the model. Our findings are closely related to the “competition vs. innovation” relationship, a well-studied theme in economics. 1998 ACM Subject Classification F.1.1 Models of Computation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Competition Relationship for Robust Visual Recognition

Joint learning of similar tasks has been a popular trend in visual recognition and proven to be beneficial. Between-task similarity often provides useful cues, such as feature sharing, for learning visual classifiers. By contrast, the competition relationship between visual recognition tasks (e.g., content independent writer identification and handwriting recognition) remains largely under-expl...

متن کامل

Market power influential approach using game theory in a two competing supply chains with multi-echelons under centralized/decentralized environments

This paper is considering the competition between two multi-echelon supply-chains on price and service under balance and imbalance of market power between the chains which are analyzing through Nash and Stackelberg game approach. The problem is categorized as the centralized or decentralized structure of each chain, which means a few different possible scenarios are developing based on the Nash...

متن کامل

Real-Time Competition Processes in Word Learning

Perceptual processes take time to unfold. Whether a person is processing a visual scene, identifying the category an object belongs to, or recognizing a word, cognitive processes involving competition across time occur. These ongoing competitive processes have been ignored in studies of learning. However, some forms of learning suggest that learning could occur while competition is ongoing, res...

متن کامل

Sequential Monte Carlo Bandits

In this paper we propose a flexible and efficient framework for handling multi-armed bandits, combining sequential Monte Carlo algorithms with hierarchical Bayesian modeling techniques. The framework naturally encompasses restless bandits, contextual bandits, and other bandit variants under a single inferential model. Despite the model’s generality, we propose efficient Monte Carlo algorithms t...

متن کامل

Outsourcing through Three-dimensional Competition

In this paper, we study an outsourced supply chain consisting of one buyer and two suppliers in which the buyer outsources manufacturing of a physical product to two competing suppliers. The suppliers compete for the buyers' demands share, and the buyer allocates the demands to the competing suppliers based on three-dimensional allocation functions. We consider two certain types of allocation f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018